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Abstract: We study pairs and m-tuples of compositions of a positive integer n with parts restricted to a subset V of 
positive integers. We obtain some exact enumeration results for the number of tuples of such compositions having the same 
number of parts. Under the uniform probability model, we obtain the asymptotics for the probability that two or, more 
.generally, m randomly and independently chosen compositions of n have the same number of parts. For a large class of 
icompositions, we show how a nice interplay between complex analysis and probability theory allows to get full asymptotics 
f—sfor this probability. Our results extend an earlier work of Bona and Knopfmacher. While we restrict our attention to 
^^compositions, our approach is also of interest for tuples of other combinatorial structures having the same number of parts. 

a 
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1 Introduction 

n this note, we study tuples of compositions of positive integers having the same number of parts, and the asymptotics of 
^-n~elated generating functions satisfying some differential equations. Let us recall that a composition of a positive integer n 

cf s any fc-tuple & > 1> °f positive integers that sum up to n. The Kj's are called the parts (or summands) of a 

'^Composition. It is elementary and well-known (see, e.g. [T]) that there are (£Zi) compositions of n with k parts, and thus 

^^there are 2™~~ 1 compositions of n. By restricted compositions we mean compositions whose parts are confined to be in a 
i__fixed subset V of N. 

The main motivation for this work is a recent paper [BJ in which the authors studied pairs of compositions with the 
' ^same number of parts. Our extension of this work is directly connected to the question of obtaining the asymptotics of 
i^^Doefficients of functions satisfying a linear differential equation which, despite the deep work by Fabry, Frobenius, Fuchs, 
^^Picard and other analysts more than one century ago, remains open and is conjectured to be undecidable. We present 
i-Hhere a new way to use probability theory in addition to complex analysis in order to solve this problem for a large class of 
^Ofunctions. 

y—( In their paper [BJ, Bona and Knopfmacher studied the asymptotic probability that two randomly and independently 
^D^hosen compositions of n have the same number of parts. Furthermore, relying on the generating function approach, for 
few specific subsets V they addressed the same question for pairs of restricted compositions. In each of these cases this 
' . probability is asymptotic to C/s/ri with C depending on V . Our main aim here is to extend these results. First, we 

i^show that this asymptotics is universal. That is, we show that for an arbitrary subset V containing two relatively prime 
k^elements the probability that two independently chosen random compositions of n with parts restricted to V have the same 

j_jnumber of parts is asymptotic to Cjyfn. The value of C depends, generally, on V and is explicit. (See our Theorem 



5.1 



Cdand subsequent remarks, which include e.g. a correction of a constant appearing in [BJ.) Secondly, we consider the same 
question for m > 2 and we show that in this case the sought probability is asymptotic to C/Vn" 1 ^ 1 for an explicitly given 



constant C whose value depends on V and m only. (See our Theorem 5.3 ) 

Bona and Knopfmacher's approach relied on complex analysis; the universality of using a more probabilistic technique 
was then noticed by Bona and Flajolet [5], where certain types of random trees were studied. Our approach is in one sense 
a mixture of complex analysis (which gives the full asymptotics expansion, up to a multiplicative constant, and with the 
price of heavy computations), and probability theory (a local limit theorem which gives without any heavy computation 
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the first asymptotic term, and therefore gives access to the multiplicative constant, but intrinsically no access to further 
asymptotic terms). Bona and Flajolet obtained, in particular, a general statement indicating how local limit theorem can 
help in evaluating probabilities that two independently chosen random structures of the same size have the same number 
of components (this is their Lemma 6 in [5], which corresponds to our Lemma 5.2 for Gaussian density with a slightly 
different proof. Our Lemma 5.2 was obtained independently, but later). As we will see, these statements remain true if one 
considers more than two random structures. 

In Section 2, we present our model. We proceed in Section 3 with some examples (and en passant, some nice questions 
in computer algebra) and argue on the intrinsic limitations of an approach relying only on complex analysis. This serves as 
a motivation for introducing the local limit law result in Section 4, which finds application in Section 5, thus solving the 
initial problem of the asymptotic evaluation of the probability that tuples of compositions have the same number of parts. 
We conclude with some perspectives in Section 6. 



2 Generating functions for pairs of compositions having the same number 
of parts 

Let us consider compositions with parts in a set V (a fixed subset of N). To avoid trivial complications caused by the fact 
that there may be no compositions of a given n with all parts from V, we assume that V has at least two elements that are 
relatively prime (except when explicitly stated otherwise). 

We introduce the generating function of the parts p(z) = X^eP Pj zJ > (Pj ^ s n0 ^ necessarily or 1, it can then be 
seen as the possible colors or the weight of part j). We thus assume that the pj's are non-negative real numbers such 
that ^2j£-pPj > I- This last condition is to ensure supercriticality of our scheme (see Section |4j below for more details). In 
the classical situation when pj is or 1, this condition holds automatically. Denote by 

p M = £ ^ n = T^) CD 

the bivariate generating function of compositions of n where k encodes its number of parts, and where the "size" of the 
composition is n. 

With a slight abuse of notation, the corresponding univariate generating function is 

P( Z ) = J2PnZ n = -\-. (2) 

Z — ' 1 — p z 



n>0 



This terminology is classical. For example, here are all the compositions of 5 with 3 parts from the set V = {1, 2, 3, 4, 10}: 
5 = 1 + 1 + 3 = 1 + 3 + 1 = 3 + 1 + 1 = 1 + 2 + 2 = 2 + 1 + 2 = 2 + 2 + 1. Accordingly, P 5:3 = 6. 

Let be the random variable giving the number of parts in a random composition of n with parts belonging to V '. 
Random means that we consider the uniform distribution among all compositions of n with parts belonging to V. 

Given two subsets V\ and T>2 of N, we consider the probability 7r„ := Pr(A^' 1 = X^ 2 ) that a random composition 
of n with parts in Vi has the same number of parts as a random composition with parts in T > 2- We assume throughout 
that, whenever two such compositions are chosen, they are chosen independently and from now on we will not be explicitly 
mentioning it. We then introduce the generating function D(z) of the number of pairs of compositions (the first one with 
parts in Pi, the second one with parts in V2) having the same size and the same number of parts. {D stands for "double" 
or "diagonal", as D(z) can be obtained as a diagonal of multivariate function.) 

That is, we consider all fc-tuples of elements of V\ and all fc-tuples of elements of Vi such that their sum is n. For a 
fixed n, let D n be the total number of such configurations (i.e., we sum over all fc). 

In the next section, we deal with some interesting examples for which we get explicit formulas. 



3 Some closed-form formulas 

3.1 An example on tuples of domino tilings 

Consider the classical combinatorial problem of tiling a 2 x n strip by dominoes. Any tiling is thus a sequence of either one 
horizontal domino or 2 vertical dominoes. The generating function of domino tilings is thus P(z) = Seq(z + z 2 ) = 1 _ 2 , 
which is the generating function of P n = F n+ \, where F n is the Fibonacci number F n — F n -i + F n -2, Fq = 0, F\ = 1. 
(Equivalently, the Fibonacci recurrence reflects the fact that removing a horizontal domino on the top of an existing 2 x n 
tiling leads to a 2 x (n — 1) tiling, while removing 2 vertical dominoes on the top leads to a 2 x (n — 2) tiling.) Let us now 
consider a less trivial question, which is archetypal of the problem we consider in this article (note that it has a closed-form 
solution but we will address later in this article similar problems having no such nice closed-form solution): 
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Puzzle 3.1 Each of m children makes a tiling of a 2 x n strip. What is the probability ir n that these m tilings all have the 
same number of vertical dominoes, when n gets large ? 

For m = 2, the number of pairs is given by D(z) = [t°] *}2 k> QP k {zt)p k (l/t), where p(z) = z + z 2 , and the Cauchy formula 
gives 

1 / 1 dt 1 f Num(z,t) , , , 

D(z) = <f> . . , , N — = i> ^rdt, 3 

w 2in J l-p(zt)p(l/t) t 2iir J Den(z,t) y ' 

where Num and Den are polynomials in z,t. Let Z(z) be any root of Den, i.e. Den(z, Z) = 0, such that Z is inside the 
contour of integration for z ~ 0. Then, a residue computation gives: 

D , s = Num{z,Z) = 1 

^ d t Den{z, Z) V^ 4 - 2z 3 - z 2 - 2z + 1 

= 1 + z + 2z 2 + 5z 3 + llz 4 + 26z 5 + 63z 6 + 153z 7 + 376z 8 + 931z 9 + 0(z 10 ) . 

This is the sequence A051286 from [T2] D n = £Lo (% k ) > B6na and Knopfmacher 6J gives a bijective proof that it is also 
the Whitney number of level n of the lattice of the ideals of the fence of order 2n. The probability that 2 tilings of a 2 x n 
strip have the same number of vertical dominoes is therefore (via singularity analysis, which can be done automatically with 
some computer algebra systems, e.g. via the equivalent command of Bruno Salvy, in the Algolib Maple package available 
at 



http : //algo . inria.fr/libraries 



..-D.,*-^****^^)."^. (4, 
2- v /7iyn V 71 "" 7 Vn 5 V n 

Note that this is consistent with the constant C given in Equation (2.10) in 6 . Our computations are available on-line in 
a Maple session^] Note that as Maple does not always simplify algebraic numbers like humans would do (some denesting 
options are missing), we used here some of our own denesting recipes so that these nested radicals become more readable 
for human eyes. 

For m = 3, it is possible to compute the diagonal D(z) via creative telescoping (as automated in Maple via the MGfun 
package of Frederic Chyzak or in Mathematica via the package HolonomicFunctions of Christoph Koutschan) . This leads 
to the following differential equation: 

= (4z 7 + 7z 6 + 7z 5 + 15z 4 + 41z 2 + z + l) D (z) 

+ (5z 8 + 12z 7 + 7z 6 + 62z 5 + 88z 3 + z 2 + 6z - l) ^-D (z) 
v ' dz 

d 2 

+z (z 2 + 1) (z 4 - z 3 + 5z 2 + z + 1) (z 2 + 48-1) -r^D (z) . 

Here, the so-called Frobenius method gives the basis of the vector space of solutions of this ODE, under the form of 
local formal solutions around any singularity, by using the associated indicial polynomial (see Chapter VII.9]). In full 
generality, the dominati ng si ngularity of D(z) is z = p rn , where p is the radius of convergence of 1/(1 — P(z)); this can be 



proven via our Theorem 5.3 In our case, the Frobenius method gives around the dominating singularity £ := %/5 — 2: 



DM - A,( 8( ^,n( = -0 + ^ln(,-C) + O((»-0 2 )) 

for some unknown coefficients Ai,A2 (related to the so-called Stokes constants or connection constants)]^] However, only 
the first summand contributes to the asymptotics of D n and a numerical scheme of our own allows to determine (with the 
help of the heuristic LLL algorithm) the value of Ai. Using singularity analysis then leads to 

5\/l5 1 5(1073-^15) 1 . 1.027340740 

tt„ + ^ --+0 1n 3 « . 

D 7rn 54 "Kn^ n 



http : //www-lipn.univ-parisl3 . f r/~banderier/Pawel/Maple/ 



iSee 

2 Notc that, as typical with the frobenius method (or also with the Birkhoff-Tritjinski method, see 1141 ), it is not always possible to decide 
the connection constant(s); in the next sections, we give a rigorous probabilistic approach which allows to get this constant, and therefore full 
asymptotics by coupling it with the Frobenius method! 
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This asymptotics also proves that F)(z) is not an algebraic function (the local basis of the differential equation involves 
a logarithmic term). 

For m = 4, creative telescoping leads to the following differential equation: 

2 (l32s 16 - 3563s 15 4- ■ ■ ■ + llo) D (a) 4- 2 (209474z 14 4- ■ - - - 1581a) D (a) 

d 2 d 3 

+ (704z 18 -| ■ - 10143s 2 ') D(z) + a (l65z 18 + • • • - EE*) U (a) 

v ' dz 2 v ' dz 3 

d i 

+ z 2 (z - 1) (a + 1) (z 2 + z + l) (z 2 - 7z + l) (z 2 - z + l) (z 4 + . . . + l) (llz 6 + . . . + ll) — D (a) . 

Using the Frobenius method and a numerical scheme of ours, this leads to 

25 5 X ^V2 , 5 5 1 / 4 % /2(47\/5- 240) iQ 1 N _ 1.186814138 



8 %/ Trn 3 256 V 7r 3 ra 5 V / tt7 a/fi 3 

It is noteworthy that this asymptotics is compatible with the fact that D{z) could be an algebraic function. However, a guess 
based on Pade approximants fails to find any algebraic equation. What is more, the index of nilpotence mod 2,3,5,7, 11 
of D(z) is 3 (i.e. the smallest i such that (d/dz) 1 = L mod p is i = 3 for primes p = 2,3, 5, 7, 11... and L is the above 
irreducible unreadable linear differential operator cancelling D(z)). Therefore, according to a conjecture of Grothendieck 
on the p-curvature (see [7]), the function is not algebraic. 

For m = 5, D(z) is a non algebraic function satisfying a differential equation of order 6 and of degree 38, which leads to 

25V5 1 1.416006588 



The closed form of the coefficients is D n {m) = Y^k=o \ k ) ' as can a ^ so ^ e obtained via a bijective proof approach. It 



is possible to get their asymptotics via the Laplace method or our Theorem 5.3 this leads to n n ~ C m / \J {■nn) m ~ 1 with 
C m = (5 3 / 4 ) m_1 /V2 m_1 m. This allows us to give a proof of the following claim (which was a conjecture by Paul D. Hanna, 
see [H A181545]): 

Proposition 3.1 D n+ i(m) / D n (m) ~ (F m ^/5 + L m )/2, where L m are the Lucas numbers, defined by the same recurrence 
as the Fibonacci numbers F m , but with different initial conditions, namely Lq = 2 and L\ = 1. 



Proof. 

D n+1 (m) = n n+1 (m)P™ +1 = C^v^n+I)P? +1 
D n (m) n n (m)P™ ^/VH" 1 " 1 ^ 



P, 



n+l 



n - 1 y VP, 

1 + (1/2 - m/2)- + O(^)) f- (1 + 0(e n ))) " ~ 

where p = p _1 (l) and the asymptotics for P„ is explained in detail in the next section (Equation [6| . In the case of 
p(z) = z + z 2 , the claim then follows from p = l/(j> and the exact formula <ft m = (F m y/b + L m )/2. □ 

Note that for all odd values of m > 2, the presence of an integer power of ir in the asymptotics of D n (m) implies that 
the function D(z) can not be algebraic, whereas for all even values of m > 2, the asymptotics match the patterns appearing 
in the asymptotics of coefficients of algebraic functions. However, we expect the following conjectures to be true. 

Conjecture 3.1 For any rational function p{z) € N(z) (with p(l) > 1), the generating function D(z) is not algebraic for 
m > 2. 

It includes the specific case D(z) = J2 n>0 D n z n with D n = Ysk=o ("fc^™ ( non algebraicity of our initial puzzle) or 
D n = X)L-=o (fe)™ ( non algebraicity of Franel numbers of order m). Nota bene: We gave here several ways to prove the 
non-algebraicity for some value of m, and we proved it for all odd m > 2, we are however unaware of any way of proving 
this for all even m > 2 at once, except, in some cases, an evaluation at some z leading to a transcendental number, or the 
Christol-Kamae-Mendes-France-Rauzy theorem on automatic sequences. 

Definition 3.2 (Closed-form sequence) A sequence of integers D n is said to have a closed-form expression if it can be 
expressed as nested sums of hypergeometric terms, with natural boundaries (i.e. the intervals of summation are and n). 
N.b: the number of nested sums has to be independent of n. 

Typical examples of closed-form expression are nested sums of binomials; without loss (or win!) of generality, it is 
possible to allow more general intervals of summation or internal summands. 
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Conjecture 3.2 Let D(z) be like in equation^ below (for any rational functions Pi(z) £ N(z)), then its coefficients D n can 
be expressed in closed form. 

An effective way of finding this nested sum could be called a " reverse Zeilberger algorithm" . It then makes sense to give 
the following broader conjecture: 

Conjecture 3.3 The coefficients of any D-finite function (i.e. a solution of a linear differential equation with polynomial 
coefficients ) can be expressed in closed form. 

Note that it follows from the theory of G-series that this does not hold for closed- forms of the type "one sum of hypergeo- 
metric terms" [TO]- The formulas we will give in the rest of this section are somehow illustrating these conjectures. 

3.2 Other nice explicit formulas 

It is clear from the previous subsection that we could play the same game for any m-tuple of compositions with parts 
restricted to m different sets, encoded by pi(z), . . . 7 p m (z). 

Proposition 3.3 The generating function for the number of m-tuples of compositions having the same number of parts is 
given by 

D(z s _ 1 f 1 dt2 a%n 

[Z >~ (2i^)'"- 1 7 l-pi{zt 2 ...t m )p 2 {l/t 2 )...p m {\/t m ) t 2 "' t m ' U 

Therefore, one should not expect any nice closed-form solution for D(z) whenever m > 2; while for m = 2, whenever all 
the pi(z)'s are polynomials or rational functions, D(z) will be an algebraic function whose coefficients can be expressed by 
nested sums of binomial coefficients (using Lagrange inversion). 

For example, if Pi(z) = p 2 (z) — 2z + z 2 (which can be considered as tilings with bicolored horizontal dominoes), one 

get Sj D„ +1 (2)=ELoE-=o(-)_(^')- 

If Pi (z) = (i.e., we consider compositions with any parts), then D n is the sequence of Franel numbers of order m: 

D n+ i(m) = X)fc=o (fc)" 1 ' anc ^ we wm see m Section [H] that the probability that m unrestricted compositions of n have the 
same number of parts is thus 7r„ ~ C m j \J (7rn) m_1 , with C m — y // 2 m ~ 1 /m. 

Note that if we replace pi(z) (for i > 1) by (1 + Pi(z)) in the integral formula of Proposition 3.3 then this gives the 
generating function of tuples of compositions such that the number of parts is in decreasing order. 

Let us add a few examples for which parts are in two different sets V\ and V 2 . If Pi(z) — z + z 2 and p 2 (z) — z + 2z 2 , 
then one gets an interesting case as we have here 

V72 + 42^/3(^5 - 5)(V2 - 2) ( 1 - y/2 - y/E + Vw\ " , co .95" 

TTn ~ 1= F= ~ l-02 . z , 

12^¥n y 2(2 -a/3) J v 7 ™ 

which is therefore exponentially smaller that the order of magnitude of our previous examples. We will comment later on 
this fact. 

Going to a slightly more general case Pi{z) = ouz + fiiZ 2 , one has for m = 2: 

D(z) = 1 = . 

y/l - 2a ia2 z + (a 2 a 2 - 2(3 1 (i 2 )z 2 - 2a 1 a 2 f3 1 f3 2 z 3 + /3 2 /3 2 z 4 

Therefore the generating function only depends on the products ct\a 2 and fiif3 2 . This implies e.g. that p±(z) = 2z + 3z 2 
and p 2 (z) = 3z + 5z 2 will lead to the same D(z) as pi(z) = 6z + z 2 and p 2 {z) = z + 15z 2 . 
Note that D{z) factors nicely when fix = f3 2 = 1: 

D(z) = \jyj{aia 2 z - 1 - 2z - z 2 )(a ia2 z - 1 + 2z - z 2 ) . 

If, additionally, a±a 2 — ±4 this gives the sum of central Delannoy numbers [1]: 

D(z) = 1/(1 -z)x l/y/l + (2+\ ai a 2 \)z + z 2 . 

When the allowed parts are only a and 6, i.e. Pi(z) — aiZ a + 0iZ b , then all the compositions in the constrained tuples 
have necessarily the same number of parts "a" (this also holds for the number of parts "6"). Choosing the order of the n\ 
parts "a" and the n 2 parts "6" leads to the formula 

D n {m)= [n™ 2 ) (<* 1 ...a m ) ni (fii...p m ) na . 

7ii a.+ri2 b—n ^ * 



5 



There is no longer such a simple formula as soon as one has more than two allowed parts, because the parts can then 
compensate each other in many ways, e.g., assume that the allowed parts contain 3 integers < a < b < c, then one can 
always create a composition V\ having n% " a" , n 2 " b" , n% " c" and a composition V 2 having m\ " a" , m 2 " b" and 7713 " c" 
such they have the same number of parts ni + n 2 + U3 = mi + m,2 + m3, but (m, n 2 , U3) 7^ (mi, TO2, TO3). To achieve this, 
consider m = c — 6, n.3 = 6 — a, 712 = ni + H3, m,2 = 0, mi = 2ni, m3 = 2ri3, thus one gets two different compositions of n: 
n = nia + ri2^ + "-3c = mi a + 777,26 + 7773c. 

If pi(z) — az + j3z 2 and P2{z) = z 2 /(l — z 2 ), then D 2n — fl n , while if pi(z) — az + [3z 2 and p 2 (z) = z/(l — z), then 

. . 1 1 1 + az 

D(z) = — I , . 

2 2 ^l-2az + z 2 {a 2 -4,3) 



So, a nice surprise is given bypi(z) = z+z 2 and p 2 (z) — z/(l—z) , for which we get D(z) = 1/2 + 1/2 y 3^3^: which is known 

to be the generating function of directed animals [HI A005773]. This sequence also counts numerous other combinatorial 
structures: variants of Dyck paths, pattern avoiding permutations, base 3 n-digit numbers with digit sum n, ... It also 
counts prefixes of Motzkin paths and this leads to an alternative formula D n+ \ = M n = 3™ — 2fe=o 3"~' c ~ 1 £ , fc, where M n 
and E n stand for meanders and excursions of length n, following the definitions and notations from [2]. 

We leave to the reader the pleasure of finding a bijective proof of all of this. (Some of them go via a bijection with 
lattice paths, as done in [6], and then via the bijection between heaps of pieces and directed animals, see Fig. 1.) Note that 
some of the bijections can lead to efficient uniform random generation algorithms. 



Figure 1: Pairs of composi- 

tions having the same number of 
parts (e.g. (1,2,2,1,2,1,2,2,2,2,2) and 
(3,1,1,1,3,2,1,1,4,1,1)) are in bijection 
with several combinatorial objects, 
e.g. lattice paths (left) and directed 
animals (right). 

In summary, it may seem possible to compute everything in all cases; however, for a generic V , in order to compute 
the constant C m involved in 7r n ~ C m , — - , we need heavier computations if the degrees of the pi(z)'s get large or 

if m is large. Current state of the art algorithms will take more than one day for m = 6, and gigabytes of memory, so 
this "computer algebra" approach (may it be via guessing or via holonomy theory) has some intrinsic limitations. What is 
more, for a given V , it remains a nice challenge to get a rigorous (Zeilbergerian computer algebra) proof for all m at once. 

In the next sections, we show that the technical conditions to get a local limit law hold, and that this allows to get the 
constant C, for any P, for all m. 

We conclude this section with Tables 1 and 2 (see next page) summarizing our main closed-form formulas. 




4 Local limit theorem for the number of parts in restricted compositions 

The discussion in this section pretty much gathers what has been developed in various parts of the compendium on Analytic 
Combinatorics by Flajolet & Sedgewick [9]. 

Our main generating function (see Equation (Jl|) is a particular case of a more general compositiorj^] scheme considered in 
Flajolet and Sedgewick, namely F(z, u) — g(uh(z)). In our case g(y) = 1/(1 — y) and h(z) = p(z). According to terminology 
used in Definition IX. 2, p. 629, Sec. IX. 3], under our assumption that J^jevPj > •"■ ^ ne scheme is supercritical (i.e., when 
z increases, one meets the singularity y — 1 of g(y) before any other potential singularity of p{z)). As a consequence, the 
number of parts is asymptotically normal as n — > 00, with both the mean and the variance linear in n. We now briefly 
recapitulate the statements from [5]. The equation p(z) = 1 has a unique positive root p € (0, 1). As a consequence, F(z, 1) 
has a dominant simple pole as its singularity and thus the number P n of compositions of n with all parts in V is 

[z"]F(z,l)~-^-p- n (l + 0(s n )), (6) 

where e is a positive number less than 1, see [HI Theorem V.l, p. 294]. The probability generating function of is given 

by 

[z n ]F(z,u) 



3 We cannot escape this polysemy: Compositions are enumerated by a composition! 
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allowed 
parts in Vi 


number of tuples (Vi, . . . ,V m ) 
of compositions of n having 
the same number of parts 


Sloane's On-line encyclopedia of integer sequences 
alternative description 




n/b / n + k(a-b) s 

D n (m) = J2( I 

k=0 \ , 


m 

J (oi. 




n - kb)/a (p 1 ...l3 m ) k 


T'Vip notation n a ^ mpfinQ tnat thp nart n i^i ponQirlpvpn 1 

with weight (or multiplicity) on in the compositions of Pi. 
(Binomials with fractional entries are considered as zero). 


{1,2} 


D n (m) 


= M 


n, — k \ 
k ) 


m 


m — 2, A051286: Whitney number of level n of the lattice 
of the ideals of the fence of order 2n. 


{1,1,2} 


ft=oj=o V/ \ 


r) 


= 4 «£( n ^Y/16* 

fe=0 \ / 


Tn = 2, A089165: partial sums of the central Delannoy numbers, 
resistance between two nodes of an infinite lattice of unit resistors, 
# of peaks of odd level in Schroeder path. 


dN 


n 

D d n(m) = 

fc=0 




Franel numbers 

( m — 2 simplifies to the central binomial numbers ( 2 j ^ 1 ) , A000984, 
m = 3,4,5,6: A000172, A005260 , A005261, A069865). 


2N- 1 


f„ + i(m) 


n 

-E 

fc = 


' n — k 
V k , 


r 


Same as pairs of compositions of n — 1 with parts in {1,2} 
(m = 2, 3, 4, 5: A051286, A181545, A181546, A181547). 


{n>l} 


D n+2 (m) 


n 

-E 

fc=0 


1 n — k 

K k , 


m 





Table 1: Summary of the main closed- form formulas (for any m) of our Section 3. 



allowed 
parts 
in Ti 


allowed 
parts 
in V2 


number of pairs (Pi ,T , 2) of 
compositions of n having 
the same number of parts 


OEIS 

reference 


alternative 
OEIS description [T2] 


{1,2} 


{1,2,2} 


fe=0 \ / 


A108488 


Expansion of 1/vT — 2a; — 3a; 3 — 4a; 3 + 4a; 4 . 


N 


NU{0} 


n 

E 

fc=0 




A047781 


Convolution of central Delannoy numbers 
and Schroeder numbers. 


{1,2} 


{1,1,2} 




A101500 


Chebyshev transform of the central 
binomial numbers (the formulas 








in this OEIS entry are not correct). 


N 


dN 






A045721 


Some coefficients are 0, as gcd(parts) 7^ 1. 

For d = 2, related to 
lattice paths, trees, standard tableaux... 


{1",2^} 


2N 




A000079 
A000244 
A000302... 


The notation "2 /3 " means that the part 2 
comes with a weight (or multiplicity) p. 


{1,2} 


N 






A005773 


directed animals (and numerous avatars 
of Motzkin paths, constrained matrices...). 



Table 2: Summary of the main closed-form formulas (for m — 2) of our Section 3. 
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In a sufficiently small neighborhood of u — I, as a function of z, F(z, u) given in ([I]) has a dominant singularity p{u) which 
is the unique positive solution of the equation 

up(p(u)) = 1. 

Consequently, 

[z"]f(z, M ) 1/0,(1))/^-"-' 
Jnl j [z«]F(z,l) ^ 

It follows from the analysis of supercritical sequences given in [9, Proposition IX. 7, p. 652] that the number of parts X„ 
satisfies 

d 



vV jp vV 



N(0,1) 



where N(0, 1) denotes a standard normal random variable whose distribution function is given by 



*(a;) = -L [ X e~ t2 ' 2 dt, 

V27T J-oo 



and where the symbol " — >" denotes the convergence in distribution. The asymptotic expressions for the expected value 
and the variance of X„ are given by 

EXl = —^r-r + 0(1) with p e (0, 1) such that pip) = 1, (7) 
PVKP) 

var(JC)=#n + 0(l) whe re K = PP " {p) + P } P ] . (8) 

[Note that the expression for the coefficient K of the variance given in Proposition IX. 7 in [9 is incorrect; the correct 
version is (ph"(p) + h'(p) — ph! \p) 2 ) / \pb! (p) 3 ) , as given in many other places in the book.] 

We now note that the central limit theorem can actually be strengthened to the local limit theorem, pretty much as 
discussed in [9l Theorem IX. 14 and the remarks following its proof on p. 697]. Let us recall the following notion. 

Definition 4.1 Let (X n ) be a sequence of integer valued random variables with EX n — p n and vax(X n ) = a 2 . Let (e„) be 
a sequence of positive numbers going to 0. We say that (X n ) satisfies a local limit theorem (of Gaussian type) with speed e n 
if 

a n Pr(X„ = [p n + xa n \ ) 



sup 



V2ir 



<S r , 



As was discussed in (9j p. 697], to see that the local limit theorem holds for restricted compositions, it suffices to check 
that p(u) when restricted to the unit circle uniquely attains its minimum^] at u = 1. This is what we prove in the following 
lemma. 

Lemma 4.2 Let p be a power series with nonnegative coefficients, of radius of convergence p p > (possibly p p — +oo). 
Let p(u) be as above the positive rooQ of up{p{u)) = 1. If p is aperiodi^^ then for < < p p and t G]0, 2tt[, we have 



p(R) < \p(Re 



i. e. the minimum on each circle is on the positive real axis. In particular, if the radius of convergence of p is larger than 
1, then for \u\ = 1 and u ^ 1 we have 

p{l) < \p(u)\. 

Proof. First, p has nonnegative real coefficients, therefore the triangle inequality gives p(\p(u)\) > \p(p(u))\. Equality can 
hold only if p{p{u)) has just nonnegative terms, but this is not possible if p(u) ^ M + as p is aperiodic with nonnegative 
coefficients. Hence one has a strict triangle inequality: p(\p(u)\) > = \l/u\ — 1/R (the middle equality is just the 

definition of p and the last equality comes from the fact we are on the circle |u| = R). As p is increasing on [0, 1/-R], we 
can apply p~ x to p(\p(u)\) > 1/R which gives p^ 1 (p(\p(u)\) > p~ 1 (l/R), that is \p(u)\ > p(R)- □ 

Note that the aperiodicity condition is important, e.g. for p(z) = z 2 + z 6 (i.e. p[u) is the radius of convergence 
of P(z,u) = 1/(1 — u(z 2 + z 6 ))), one has p(— 1) = ip(l); however some periodic cases have a unique minimum on the unit 
circle, e.g. p(z) = z 2 + z 4 . Note also that (in either periodic or aperiodic case), the uniqueness of the minimum on the 
circle |u| = 1 at u = 1 does not hold in general for the other roots of up{p{u)) = 1. 

4 There is a typo in [9] p. 697: the inequality direction is wrong. 

5 p(z) has nonnegative real coefficients and is thus increasing in a neighborhood of 0, i.e. on z S [0, +e]. p being analytic near 0, is continuous 
and for any i£R small enough, p(z) = x will therefore have a real positive root z x , and this root will be analytic in x. This is the root that we 
call "real positive". 

6 A power series p is said to be periodic if and only if there exists a power series q and an integer g > 1 such that p(z) = q(z 9 ). Equivalently, 
the gcd of the support (=the ranks of nonzero coefficients) of the power series p is g ^ 1. If this gcd g equals 1, then p is said to be aperiodic. 
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5 Asymptotic probability that restricted compositions have the same num- 
ber of parts 

Our motivation for including the results from [5] in the preceding section is the following theorem which considerably extends 
the main results of [BJ. We single out the case m — 2 since in some cases it has been already studied in the literature. 

5.1 Pairs of compositions 

Theorem 5.1 Let PcN. The probability that two random compositions with parts in V have the same number of parts 
is, asymptotically as n — > oo. 

C 

where the value of C is related to the constant K from Equation [#| namely: 

~ , (9) 

Before proving this theorem let us make some comments. 
Remarks and examples: 

(i) Some special cases were considered in [6j. They include unrestricted compositions (V = N), V = {1,2}, or more 
generally V = {a, b} with a, b relatively prime, compositions with all parts of size at least d (V = {n E N : n > d}), 
and compositions with all parts odd and at least d. The arguments of [BJ rely on the analysis of the asymptotics of the 
bivariate generating functions, which is sometimes difficult and does not seem to be easily amenable to the analysis 
in the case of a general subset V of positive integers. Our approach is much more probabilistic and relies on a local 
limit theorem for the number of parts in a random composition with parts in V '. This turned out to be a much more 
universal tool. 

(ii) To illustrate the principle behind our approach, consider the unrestricted compositions. As was observed in [11], in 
that case is distributed like 1 + Bin(n — 1, 1/2) random variable. Therefore, 

7r n = Pr(Bin(n - 1, 1/2) = Bin'(n - 1, 1/2)) 

where Bin and Bin' denote two independent binomial random variables with specified parameters. Since the second 
parameter is 1/2 we have 

Bin(n - 1, 1/2) = n - 1 - Bin(n - 1, 1/2). 

Therefore, by independence we get 

tt„ = Pr(Bin(ra - 1, 1/2) + Bin'(n - 1, 1/2) = n - 1). 

Finally, since 

Bin(n - 1, 1/2) + Bin' (n - 1, 1/2) = Bin(2(n - 1), 1/2), 
we obtain by Stirling's formula that 

\ n-1 ) 1 



Pr(Bin(2n-2,l/2) =n-l) 



2 2n-2 



This is consistent with (|9| (and with [6 ) as for unrestricted compositions p(z) = J2k>i zk ~ — z )> so that p — 
1/2, p'{z) = 1/(1 - z) 2 , and p"(z) = 2/(1 - zf which gives C = 1. 

(iii) Although the above argument may look very special and heavily reliant on the properties of binomial random vari- 
ables, our point here is that it is actually quite general. The key feature is that the number of parts (whether in 
unrestricted or arbitrarily restricted compositions) satisfies the local limit theorem of Gaussian type, and this is 



enough to asymptotically evaluate the probability in Theorem 5.1 



(iv) For another example, consider compositions of n into two parts, i.e. V = {a, b} with a, b relatively prime. Then 
Theorem 15. II holds with 

c=^ a + b ^ (10) 

2\a-b\y/p 1 ^ 
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where p is the unique root of z a + z b = 1 in the interval (0,1). In this case p(z) = z a + z b so that p'(z) 
and p"(z) = a(a — l)z a ~ 2 + b(b — l)z b ~ 2 . Thus, writing the numerator of ^ as 

p{P'{p)f /2 = ^{pP'ip)?' 2 = -^(ap a + bp b f'\ 

Vp Vp 



az" 



■bz 



6-1 



we only need to check that 
But 



P P"{p) + P'{p) - p{P'{p)f = (a - b) 2 p a + b -\ 
pP"(p) + P'(p) = a 2 p a ~ 1 + b 2 p b ^ 



(11) 



so that the left-hand side of (11) is 



a 2 p a-l , U 2J>-1 



b 2 p°- 1 - a 2 p 2 °- 1 - b 2 p 2b - x - 2abp a+b -\ 
Factoring and using p a + p b = 1, we see that this is 

aV _1 (l - P a ) + b 2 p b -\l - P b ) - 2abp a+b - 1 = (a- b) 2 p a+b ~ l : 

as claimed. 

When a — 1 and b = 2 we have the Fibonacci numbers relation so that p — (Vo — l)/2 and (10) becomes 

(p + 2^)3/ 2 1 5^ 
°~ 2p3/2 - 2 [1 + ZP) - 2 ' 

which agrees with Q above and also with the expression given in [3] (see equation (2.10) therein). However, in the 
case of general a and b, the value of C was given in the last display of Section 3 in as 



p(ap a ~ x +bp b - 1 ) 2 



v/4(a + b)p 2a + 2b ~ 2 + 2(1 - p 2a - p 2b )(ap 2a - 2 + bp 2b ~ 2 ) ' 



(12) 



This is incorrect as it is lacking a factor \a— b\ in the denominator (so that it gives the correct value of C when \a — b\ = 1 
but not otherwise). To see this and also to reconcile ( 12 ) with ( 10 ) (up to a factor \a — b\) we simplify ( 12 ) by noting 
that p a + p b = 1 implies 



i 2a 2b 

l-p - p 



1 - (p ) - p = (1 + p )p - p =p(l + p-p) = 2p 



2b 



so that the expression under the square root sign in (12) becomes 

4p a+b - 2 ((a + b)p a+b + ap 2a + bp 2b ) = 4p a+b - 2 (ap a + bp b ){p a + p b ). 



Using again p a 



1 ( 12 ) is seen to be 



p 2 (ap a - 1 + bp^ 1 ) 2 _ (ap a + bp b fl 2 



2^p a + b (ap a + bp b ) 2^J~pP 



which, except for the factor \a — b\ in the denominator, agrees with (10). 

(v) Other examples from [B] can be rederived in the same fashion, but we once again would like to stress universality 
of our approach. As an extreme example, we can only repeat after [9]: even if we consider compositions into twin 
primes, V = {3, 5, 7, 11, 13, 17, 19, 29, 31, . . . }, we know that the probability of two such compositions having the same 
number of parts is of order 1/y/n. This is rather remarkable, considering the fact that it is not even known whether 
this set V is finite or not. 



Proof of Theorem 1 5 . 1 [ This will follow immediately from the following lemma applied to X n = and formula ^ 
which gives the expression for a n . This lemma should be compared with a more general Lemma 6 of |5 . We include our 
proof to illustrate that seemingly very special arguments used in item (ii) are actually quite general. □ 

Lemma 5.2 Let (X n ) with EX n — p n and var(X„) — cr„ — > oo as n — » oo, be a sequence of integer valued random variables 
satisfying a local limit theorem (of Gaussian type) with speed e n as described in Definition 4-1 Let {X' n ) be an independent 
copy of {X n ) defined on the same probability space. Then 



7r„ = Pr(X„ = X' n ) 



2y/iro- n 



O 
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Proof. For X n and X' n as in the statement we have 

7T„ = Pr(X„ = X' n ) = Pr ( X « = k = X 'n) = Pf2 ( X " = fc ) 



fe>l 



k>l 



(13) 



Now, 



Pr(X„ = Lm«J + = Pr (^" = Lm«J - i) + { Pr(X n = [fi n \ +£)- Pr(X n = L/i„J - £)}. 
To estimate the term in the curly brackets take x+ and X- such that 

L^nJ + ^ = LA*" + x +°n\ , and Li"nJ - I = l^n - ^-0"nj ■ 

By elementary considerations, —2{[i n }/<7 n < x + — < 2(1 — {/in}) /oVi (where {z} is the fractional part of z), hence 
X-\ < 2/cr„. Then 



Pr(X„ = La*„J + 1) - Pr(X„ = L/i„J - = 



1 



V2 



7TOY, 



Pr(X„ = [fx n + z+0"nj) 



27T(T r , 



Pr(Jf„ = |_Mn - x-v n \) 



-x 2 _/2 



2na„ 



The absolute value of the second term is 



1 * 2 

a n Pr(X n = [[i n + x + a n \ ) — e 



^2vr 



< 



and similarly with the third term. Applying the inequality \f{x) — f(y)\ < \x — y\ sup |/'(f)| to the first term gives 

2 2 

e~T~ — e~T~ < |x+ — X-\ = 0(l/a n ) and so, the first term is 0(1/ct£). Therefore, 



= O 



£n , 1 



£« , 1 



Pr(X n = L/x»J + £) - Pr(X n = [fi n \ - i) 
Coming back to equation (13), we see that 

Pr(X n = [fi n \ + 1) ( Pr{X n = [fi n \ -£) + 

= ( J2 Pr(X n = [fi n \ +t,X' n = L/inJ -f) +1X0 

\i=-oo / 

= Pr(X„ + X' n = 2LMnJ ) + O ( — + \). 

Since X n + X' n is a sum of two i.i.d. random variables, it has mean 2/jL n and the variance 2a\. Furthermore, since each 
of the summands satisfies the local limit theorem of Gaussian type, so does the sum (its probability generating function 
is the square of f n (u) and thus falls into quasi-power category, just as f n (u) does). Since 2|_/XnJ = L^Mn + x\[2a n \ for 
some x = 0(l/o-„), just as before we have 

V2a n Pr(X n + X' n = l2fi n \) - 

Consequently, 







= (s n - 


-i) 







7r„ = Pr{X n = X' n ) = Pr(X n +X' n = [2^ n \) 
which completes the proof of Lemma |5.2| and of Theorem |5.1| 



1 



2y / 7T(7 n 



o 



□ 
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5.2 Tuples of compositions 

Here we sketch a proof of the following extension of Theorem |5.1 



Theorem 5.3 Let PcN and let m > 2 be fixed. Then, the probability 7r„ that m randomly and independently chosen 
compositions with parts in V all have the same number of parts is, asymptotically as n oo, 



where C m is related to the constant K from Equation^ namely: 

r _J 1 ( P 2 (P'(P)) 3 X(m ~ 1)/2 

G m = V K - 



V2 m - 1 m V2 m ~ 1 m \PP"(p) + P'{p) - p(p'(p)) 2 

Remark. For unrestricted compositions, the expression in the big parentheses is 2 (see (i) in Remarks above). This 
gives C m = y t /2 m ~ 1 /m as stated in Section 



3.1 



Proof of Theorem \5.3\ This follows immediately from the following statement which itself is a straightforward extension of 
Lemma 6 of [5] with essentially the same proof. We will be using it for Gaussian density in which case 

1 f°° ^rr^ 1 1 

g m (x)dx = —j= \ e 2 dx = 



^{2ir) m J-oo v / ( 27r ) m_1 V"» 

Lemma 5.4 (Bona-Flajolet). Let (X n ) be integer valued with fi n = EX n , <r„ = var(A n ) — > oo as n — > oo. Let g be the 
probability density function and suppose that 

lim sup | Pr(Jf„ = |_Mn + xa n \) - g(x)\ = 0. 

n— >oo ^ 

Let further (Xn), k = 1, . . . , m &e independent copies of the sequence (X n ) defined on the same probability space. Then 



<- 1 Pr(X«=4 2 )=---=X("'))^ / ^(arjda;, as n oo. 

•/ — OO 

To see this we just follow the argument in [5j Lemma 6] with obvious adjustments: the left-hand side above is 

oo oo 

a™" 1 ^(Xn = k) = air 1 E Pr ™( X « = 1^ + x k a n \), 

k=l k=l 

with < x k < k+1 -V" . This is further equal to 

— ^KPr(X„ = [fin + x k a n \)) m ~ — ^ 5 m (^) ~ — f ff m (^^)dx, 



where the first approximation holds by the assumption of the lemma (after having first restricted the range of x k s to a 
large compact set) and the second by the Riemann sum approximation of the integral. Since the expression on the right 
is /_ g m (x)dx, the result follows. □ 



6 Concluding remarks 

1. In this article, we restricted our attention to compositions (giving first several new closed-form formulas, and then 



going to the asymptotics), but it is clear that Lemma 5.4 can be applied to many combinatorial structures, e.g. the 



probability that m random permutations of size n have the same number of cycles (see |13j for the case m = 2), 
or the probability that m permutations have a longest increasing subsequence of the same length, or the probability 
that m random planar maps have a largest component of same size. This leads to interesting analytic/computational 
considerations, as it will involve evaluating the integral of g m (x) where g(x) will be the Tracy-Widom distribution 
density (provided the local limit theorem holds, which has not been proven yet), or the map- Airy distribution density 
(for which a local limit theorem was established, see [3]). 

A similar approach can be also applied to tuples of combinatorial structures following m different local limit laws 
(with m densities having fast decreasing tails), as long as they have the same mean. 
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When the means are not the same, the probability of the same number of parts is generally of much smaller order. 
This is because if X n has mean cn and X' n has mean c'n and both have linear variances, then assuming w.l.o.g. c > c! 
and choosing a < we note that if \X n — cn\ < an and \X' n — c'n\ < an then 

X n — X' n > cn — an — (c'n + an) — (c — c — 2a)n > 0, 

so that X n 7^ X' n . Therefore, 

7T„ = Pr(V„ = X' n ) < Pr(|X„ - cn\ > an) + Pr(\X' n - c'n\ > an). 

Since both X n and X' n converge to a Gaussian law and <r n = an, the first probability is roughly (with /3 = a/y/a) 

(\X n - cn\ n _\ 1 f°° t 2 /2 , 1 
Pr 1 - 1 > = / e"' /2 dt 



by the well-known bound on the tails of Gaussian random variables (see, e.g. [8| Chapter VII, Lemma 2]). This 
is consistent with an example discussed in Section |3.1| The difficulty with making this argument rigorous is that 
the error in the first approximation is usually of much bigger (typically l/^/n) magnitude than the quantities that 
are approximated. However, a slightly weaker bound, namely, e - ^™ (with a generally different value of /?) can be 
obtained by using Theorem IX. 15 in [S] which asserts that tail probabilities of random variables falling in the scheme 
of quasi-powers are decaying exponentially fast. While this theorem is stated for the logarithm of Pr(|V„ — cn\ > an), 
it is clear from its proof that one actually gets exponential bound on the tail probabilities (see Equation (88) on p. 701 
in [9] and a few sentences following it). 

The Gaussian local limit law explains the universality of the l/(-Kn)( m ~ 1%> l 2 appearance for numerous combinatorial 
problems in which we would force m combinatorial structures of size n to have an extra parameter of the same value. 



We also wish to point out yet another insight provided by the probabilistic approach. As we mentioned in Section |3T 
(see Footnote |2j, it allows to solve the connection constant problem intrinsic to the Frobenius method, and therefore, 
a combination of these two approaches (local limit law plus Frobenius method) gives access to full asymptotics in 
numerous cases. 
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